Reverse engineering of linking preferences from network restructuring 
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We provide a method to deduce the preferences governing the restructuring dynamics of a network from the 
observed rewiring of the edges. Our approach is apphcable for systems in which the preferences can be formu- 
lated in terms of a single-vertex energy function with f(k) being the contribution of a node of degree k to the 
total energy, and the dynamics obeys the detailed balance. The method is first tested by Monte-Carlo simula- 
tions of restructuring graphs with known energies, then it is used to study variations of real network systems 
ranging from the co-authorship network of scientific publications to the asset graphs of the New York Stock 
Exchange. The empirical energies obtained from the restructuring can be described by a universal function 
f{k) ~ ~k In k, which is consistent with and justifies the validity of the preferential attachment rule proposed 
for growing networks. 

PACS numbers: 89.75.Hc,02.70.Rr,05.70.-a,05.90.+m 



I. INTRODUCTION 

In the past few years, the analysis of the network struc- 
ture of complex systems has become a rapidly expanding in- 
terdisciplinary field (n S 01 ■ Interpreting the dynamics of 
these complex networks with techniques developed in statis- 
tical mechanics represents a new approach to network theory 
jlllli0lili[l3lll- A recently introduced method is to de- 
fine equilibrium network ensembles as stationary ensembles 
of graphs generated by restructuring processes obeying de- 
tailed balance and ergodicity fllll . During such a restructuring 
process, edges are removed and/or inserted. If an energy func- 
tion of the graphs is also given, then the construction of the 
corresponding microcanonical, canonical and grand canoni- 
cal ensembles can be carried out in a similar way to classical 
statistical physics. However, unlike in many physical systems, 
the energy of a graph cannot be derived from first principles 
and does not necessarily exist. In case of graph ensembles de- 
signed to have some prescribed property, a properly defined 
cost function (or energy) can capture the deviations from this 
property. This method of defining energy is typically used in 
optimization contexts. 

For network systems, in which the restructuring is assumed 
to be governed by interactions that can be interpreted in terms 
of energy, one can try to deduce this energy using the data 
obtained from the time series of the evolution of the system. 
In the present work, our goal is to demonstrate this reverse 
engineering process in detail. In particular, we shall put our 
methods to practice for a few different network systems such 
as the co-authorship networks of scientific publications I12L 
[l?,'!?,'!?,'!?,'!?!, the network of the US film actors | i8','l?L 
and the asset graphs of the New York Stock Exchange L2Q.1 . 



serted) can be thought of as particles in a physical system and 
the vertices as the volume of the system. Since the properties 
of growing networks have already been studied in great de- 
tails, here we assume that the number of vertices, N, is fixed. 

In the statistical mechanics picture, the time evolution of 
the probability of occuiTence of the graphs is governed by a 
set of master equations | IJJ : 



dPa 

dt 



b 



PaTa-^b) , 



(1) 



where Pa is the probability of graph a and Va^b denotes the 
transition rate from graph a to graph h. If the dynamics is 
governed by an energy function, then it determines the ratio 
of the transition rates as 



-{\E^i-liAM^i)/T 



Tb- 



(2) 



In the equation above AEab = — Ea is the energy dif- 
ference between the graphs, Mab — Mb — Ma denotes the 
difference in the total number of edges, /i is the chemical po- 
tential associated with the appearance of extra edges in the 
system and T denotes the temperature, corresponding to the 
level of noise. If M is kept constant, then in the T ^ oo limit 
the dynamics converges to a totally random rewiring process, 
and thus, the ensemble of classical random graphs 1 21] is re- 
covered. On the other hand, at low temperatures the graphs 
with lowest energy (meeting the requirements set up by the 
preferences) are selected with enhanced probability. 

As the transition rates obey detailed balance if the dynam- 
ics is ergodic, the probability distribution will converge to a 
stationary distribution. 



II. RESTRUCTURING PROCESSES AND PREFERENCES 

In the following we assume that the preference of establish- 
ing a given link can be expressed in terms of a change in the 
network's total energy. During this graph restructuring pro- 
cess, the edges (which can be deleted, relocated or newly in- 



III. SINGLE-VERTEX ENERGIES 

The energy function of a network corresponds to the pref- 
erences in the rearranging of the graph structure (e.g. when 
relocating an edge from one vertex to another), if only the lo- 
cal properties of the graph influence the edge relocation, then 
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the simplest form of the energy is given by the sum of con- 
tributions from the individual vertices. Assuming that these 
contributions depend only on the degrees of the vertices, the 
total energy can be written as 



N 



E 



(3) 



where N is the total number of vertices, and ki denotes the de- 
gree of vertex i. Note that if /(fc) is shifted by a constant, the 
resulting dynamics remains unchanged since only the differ- 
ence between energies associated with different degrees may 
influence the restructuring. Therefore, without loss of gener- 
ality we may set /(O) = 0. Furthermore, the linear part of / 
is irrelevant if the number of edges is constant (since its con- 
tribution is proportional to the number of edges in the graph), 
and simply renormalizes the chemical potential if the number 
of edges is allowed to change. 

An alternative form of the single-vertex energy functions 
can be written as 



N 



(4) 



i— 1 i' 



where i' runs over all vertices that are neighbors of vertex i. In 
this interpretation, the energy of an individual vertex depends 
on the connectivities of its neighbors, and vertex i collects an 
energy g{kii) from each of its neighbors. These neighbors in 
turn will all collect g{ki) from vertex i, therefore, the total 
contribution to the energy from vertex i is kigiki). Thus, by 
choosing 



f{k) = kg{k), 



(5) 



the two alternative forms of the single-vertex energy, and 
(0), become equivalent. 

The advantage of the latter representation is that the irrele- 
vant linear part of f{k) appears as a simple additional constant 
term in g{k) for fc > 1. 



IV. REVERSE ENGINEERING WITH INDEPENDENT 
EDGE DYNAMICS: THE METHOD 

In the following we assume that the energy function of the 
investigated system falls into the class of the single-vertex en- 
ergy functions (0}. The main point of our method is that it 
assumes independent dynamics for each half edge. In this ap- 
proach, each vertex i is treated as a hedgehog, with ki half 
edges. Since the energy function depends only on the de- 
gree sequence, the energy of the network is identical to that 
of the set of isolated hedgehogs. The restructuring of the net- 
work can be approximated by the insertion and deletion of 
half edges. This approximation is valid when the number of 
vertices is large compared to the typical number of half edges 
on the individual vertices. Let us denote by rk^k-i the rate at 
which existing half edges on vertices with degree fc disappear, 
and accordingly by rk-i^k the rate at which new half edges 



appear on vertices with degree fc — 1. The ratio of these two 
rates is determined by the energy in a way similar to Eq. 0: 



^U{k)-i,k)/T 
=(/(fe-i)-A(fe-i))/T' 



(6) 



where jl — ijl/2 denotes the chemical potential for the half 
edges. From (|6|l it is clear that the effect of the chemical po- 
tential on the dynamics cannot be separated from the effect of 
the linear part of /(fc), and the multiplication of /(fc) and /i 
by the same number is equivalent to the rescaling of the tem- 
perature. For these reasons it is convenient to introduce the 
dimensionless single-vertex energy 



m 



I{k)~~^k 
T 



(7) 



which incorporates the chemical potential as well. This yields 

(8) 



Tk-l^k 



The number of disappearing edges on vertices with degree 
fc per unit time can be written as 



Ik~>k-i = rk^k-iPkNk, 



(9) 



because there are pkN vertices with fc edges and these have a 
total of pkNk edges on them. Similarly, the current between 
the same two states in the opposite direction is 



h-i^k = rk-i^kPk-iN{N - 1), 



(10) 



because there are pk-iN vertices with fc — 1 edges and the 
appearing new half edge can be placed at — 1 different po- 
sitions (corresponding to the A^ — 1 possible neighbors of a 
given vertex). For simple graphs (i.e., when only zero or one 
connection is allowed between two vertices) the number of 
these possible positions is A^ — 1 — (fc — 1) and accordingly 
the (A^ — 1) factor in expression ilQ\ should be replaced by 
(A^ — fc). However, in all cases studied in this paper we have 
A^ ^ fc, and therefore we may use 



Ik-i-*k = rk-i^kPk~iN 



(11) 



in all situations. From the ratio of the currents Jl 1> and (|9} 
and Eq. (|8j we get 



„0(fc)-0(fc-i) 



h^k-i Pk-i N 
h-i^k Pk k ' 



(12) 



The quantities appearing on the right hand side can be ob- 
tained from the available data and the currents can be com- 
puted as follows. At each time step each vertex should be 
compared to its own state in the next time step. Those links 
that appear only in the latter graph contribute a current of 
amount 1 to Ik^k+i, where fc is the degree of the vertex in 
the former state, whereas those links that are deleted in the 
time step contribute to Ik—,k-i- If ^ new vertex appears in the 
second graph, each of its links will contribute to /o^i with 
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: We illustrate the measurement of the currents Ik^k+i and 
1 in the dataset by comparing two subsequent states of ver- 



FIG. 1 

tex in a hypothetical simple graph with 17 vertices. (Only edges 
connected to vertex are shown). In the initial state (left side), the 
central vertex is connected to vertices 1, 2, 4, 5 with one edge and to 
vertex 3 with two edges, hence its degree is fc = 6. In the second 
state (right side), the link to vertex 2 and one of the links to vertex 
3 disappeared giving a contribution /6->5 = 2 to the currents. Simi- 
larly, new links to vertices 6, 12 and 13 appeared, thus /e^? = 3. 



a current of amount 1. Similarly, links of disappearing ver- 
tices of degree k will contribute to Ik~tk-i- A simple exam- 
ple demonstrating the evaluation of the currents in the frame 
of independent edge dynamics is shown in Fig. [2 

Since we set /(O) = 0, the corresponding 0(fc) will also be 
zero for fc = 0. For fc > 1, we can express (j){k) using the 
recursion relation (I12> as 



1=1 



h^i-i pi-i N 



h- 



1^1 



Pi I 



(13) 



In the alternative representation of the energy defined in equa- 
tions and (|5jl, one can introduce 



7(fc) 



Hk) ^ gjk) - p 

k T ■ 



(14) 



V. CASE STUDIES 

We first tested our method on known energy functions used 
in Monte Carlo simulations, then applied it to study real world 
systems ranging from the co-authorship networks of scientific 
publications to the asset graphs of the New York Stock Ex- 
change. In each case, the statistics of vertices with a given 
degree became poor with increasing fc, therefore only the in- 
terval of the well represented degrees was taken into account 
in our reverse engineering process. The upper boundary of 
this interval, fc,„ax, was set to the point where the time aver- 
age of the number of vertices with degree fc became smaller 
than 0.1% of the average number of vertices. 

As explained in the previous sections, when the total num- 
ber of edges, M, is fixed, the linear part of /(fc) is iiTelevant. 
When M is allowed to vary, a linear function, —fik, naturally 
appears in Eq. and cannot be uniquely separated from 
the energy. Therefore, we will focus on the non-linear part 
of the empirically obtained (/)(fc), corresponding to the non- 
linear part of the single-vertex energy. In general, the (j){k) 



obtained from the data can be decomposed into 

0(fc) =Co + Cifc + 0*(fc), 



(15) 



where (j>*{k) denotes the non-linear part of 4>{k). The addi- 
tional Co constant shift is introduced to account for the ir- 
regular behavior of the energy of vertices with zero degrees in 
the collaboration networks (the networks of the scientific pub- 
lications and the US movie actors). In these systems 0(fc) is 
relatively smooth for fc > 1 in contrast to a gap in the energy 
observed between fc = and fc = 1. Accordingly, fc = has 
been excluded when fitting the empirical results, and the shift 
of the fc > 1 part caused by the gap is taken into account with 
the help of Cq. The j{k) corresponding to il5i can be written 



7(fc) -Co/fc + Ci+7*(fc), 



(16) 



where 7*(fc) ~ (j)*{k)/k. In the expression above, the first 
term decays for high fc and the second is a constant, therefore, 
in practical cases the remaining non-linear part is somewhat 
more apparent than in the 0(fc) representation. 

In the figures showing our results, we shall plot (j){k) — Cifc 
and 7(fc) — Co/fc — Ci, since this way both the non-linear 
part of the energy and the gap at fc = for the collaboration 
networks become apparent. 



A. Testing with known energy functions 

We first tested our method on known energy functions used 
in Monte Carlo simulations with a constant (N = 10000) 
number of vertices and (fc) = 2. The most trivial choice for 
the energy function was the case when no energy was present 
in the rewiring and the edges were replaced randomly. In 
this case the network converges to a classical random graph 
regardless of the initial state, and accordingly the maximum 
degree taken into account in the reverse engineering process 
was fcmax = 7. The other tested energy was chosen to be 
/(fc) — — fcln(fc). This choice was motivated by our previ- 
ous studies which showed that in this system, around T = 1 a 
scale-free degree distribution can be observed 1 10|, hence the 
interval of fc taken into account in the reverse engineering is 
wider (fcmax — 27) than in case of the random rewiring. 

Fig. El shows our results for the two test energies. In both 
cases, Co was set to zero, and Ci was set equal to (p{k = 1), 
(since both energies used in the MC simulations give /(I) = 
0). In both situations, the resulting empirical 0*(fc) could be 
fitted well with the energy used in the MC simulation. 



B. Collaboration networks 

The collaboration networks investigated in this paper can be 
listed as follows: the co-authorship networks of neuroscience 
and math publications, the co-authorship networks of the Los 
Alamos e-print archives of Condensed Matter, High Energy 
Physics - Phenomenology and Astrophysics, and the network 
of the US movie actors. These networks are all bipartite: the 
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FIG. 2: 4>{k) obtained from the restructuring data of the MC simula- 
tion with C\k subtracted, together with fits. For the case of iJ = 0, 
(random rewiring, circles), the system converges to a classical ran- 
dom graph, which results in a narrow degree distribution. In the sys- 
tem with E = — fc In(fc) (crosses), the interval in which 0(fc) can be 
obtained is wider. Thus the results from reverse engineering are in 
good agreement with the initial MC energies. 



FIG. 3: The empirical (j>{k) obtained for the neuroscience publica- 
tion network (circles) for r = 3. In order to make the non-linear part 
of (j>{k) more apparent, the linear part of the fit, Cik was subtracted 
from the data. The actual values obtained for the fitting parameters 
were Co = —6.7, Ci = 12.6, a = 0.96. A relatively large gap be- 
tween 4'{k) at fc = and k = 1 can be observed. The inset shows 
the corresponding 7(fc) with Co/k + Ci subtracted from the data 
(boxes) together with — 51n(fc) (dashed line). 



collaborators (scientists, actors) and the collaborations (pub- 
lications, movies) can be represented by two different kinds 
of vertices, and the links in the system are always placed be- 
tween vertices of different types (a scientist is connected to 
the papers in which he/she is a coauthor). Uni-partite projec- 
tions of these graphs can be made by keeping only one kind of 
the vertices and placing a Unk between them if they are linked 
to a common vertex of the other type in the bipartite graph. In 
the cases studied in this paper, the projection onto the graphs 
of collaborators has been used: the nodes represent collabo- 
rators (scientists or movie actors) and two nodes are linked if 
they have a common collaboration (articles, movies). In the 
graphs obtained this way, both edges and vertices are distin- 
guishable and multiple links between vertices are allowed. 

The collaboration network databases we have used ll22ll23ll 
provide the time of the emergence of the links, but the times 
when they disappear are not defined. In other words, the time 
we consider an edge to be alive after its birth influences the 
structure of the graphs representing the system year after year 
This parameter will be denoted by r, the life-time of an edge. 
The minimal value for r is equal to the time resolution of the 
dataset. In case of r = cx), the system is simply growing and 
no edges are deleted. Evidently, the choice of t may influence 
all statistical properties of the network. 

The number of vertices in these systems is changing slowly 
with time, but the number of edge restructuring events per 
time step is larger by two orders of magnitude than the change 
in the number of vertices. Therefore, the effect of the increase 
or decrease in N can be neglected, and we use the time aver- 
age of the right hand side of Eq. (I12> (derived for a fixed N) 
to calculate 

The most important statistical properties of the studied 
networks are summarized in Table |I] 



1. Neuroscience- and math publications 

Data on these two systems was obtained from Ref. i22ll . 
The time resolution of the appearance of the new articles was 
one year in the dataset. The upper boundary of the interval 
of k taken into account in the reverse engineering was in the 
range of 20 — 40. The results in case of the neuroscience 
papers for t — "i years are shown in Fig. |3l and very similar 
plots were obtained for the math publications. In both cases, 
(/)*(fc) and 7*(fc) can be fitted with 

</>*(fc) = -5A:ln(fc), 7*(fc) = -51n(fc) (17) 

where 3 is a positive constant. The actual value of this con- 
stant was 5 — 0.96 for the neuroscience publications and 
a = 0.99 in case of the math papers. 

In both cases, ip{k — 0) is separated from the fc > 1 part by 
a gap, indicating that zero connection is unfavorable for the 
vertices. This is consistent with the observed degree distribu- 
tions, in which the vertices of zero degree were suppressed. 
(In case of the math papers, vertices with zero degree were 
totally absent). 

For r = 1 and r = 2, similar results can be obtained with 
modified fitting parameters. 



2. The Los Alamos e-print archives 

We also studied the restructuring of the co-authorship net- 
works of three e-print archives: Condensed Matter, High En- 
ergy Physics - Phenomenology and Astrophysics |23]. In 
all three cases, the time resolution of the appearance of the 
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T — 1 year 


r = 2 year 


T — 3 year 




timespan 


timestep 


(iV) 


(k) 


^max 


a 


(iV) 


(k) 


^max 


Q 


(N) 


(fc> 


^max 


a 


movie actors 


1896-1999 


1 year 


9223 


28.04 


100 


0.56 


15350 


34.52 


130 


0.69 


20373 


39.23 


150 


0.74 


neurosci. publ. 


1991-1998 


1 year 


55057 


7.837 


35 


0.86 


87513 


9.397 


38 


1.02 


113160 


10.54 


44 


0.96 


math publ. 


1991-1999 


1 year 


17279 


2.737 


15 


0.83 


29116 


3.141 


17 


0.93 


38633 


3.45 


20 


0.99 


hep-ph 


'92.03-^04.02 


1 month 


3202 


7.14 


23 


0.41 


4386 


8.52 


30 


0.58 


5186 


8.48 


35 


0.62 


astro-ph 


'92.04-'04.02 


1 month 


5472 


10.90 


60 


0.44 


7103 


13.02 


70 


0.56 


7891 


13.98 


80 


0.59 


cond-mat 


'92.04-'04.02 


1 month 


5648 


4.16 


29 


0.73 


7464 


4.71 


34 


0.96 


8194 


4.94 


37 


0.91 



TABLE I: The most important statistical properties of the studied collaboration networks, fcmax denotes the maximal degree which occurred 
frequently enough in the restructuring process to be able to extract the energy, and 5 is the fitting parameter introduced in Eq. <17t . 
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FIG. 4: The empirical (/!>(fc) of the Astrophysics archive for 
T — 3 years with Cifc subtracted (circles) plotted together with 
— 5fcln(fe) + Co (solid line). The values of the fitting parameters 
read Co = -4.8, Ci = 8.4, 5 = 0.59. The 4>{k = 0) is separated 
from the k > 1 part by a gap, similarly to the case shown in Fig|3| 
In the inset the corresponding 7(fc) — Co/k — Ci is shown together 
with -0.59 In(fc). 
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FIG. 5: (t){k) obtained in case of the US movie actors network for 
r = 3 with Cifc subtracted from the data (circles). The resulting 
functions can be fitted with —ak In(fc) + Co similarly to previously 
studied collaboration networks. The actual values of the fitting pa- 
rameters are Co = —14, Ci = 10.4, a = 0.74. The gap separating 
(j){k = 0) from the fc > 1 part, characteristic for the other collabora- 
tion networks can be seen in this case as well. The "/{k) correspond- 
ing to this is shown in the inset with Co/k + Ci subtracted from 
the data, plotted together with —a In(fc). 



new manuscripts was one month. To be able to compare the 
properties of these systems to the previously investigated co- 
authorship networks, we set r = 1,2,3 years in these cases 
as well. From the point of view of the energy,the obtained 
behavior was very similar to the previous cases. For the Con- 
densed Matter, High Energy Physics - Phenomenology net- 
works, fcniax was in the range of that seen for the neuroscience 
publications, and for the Astrophysics archive it was some- 
what larger (fcmax = 80). The non-linear part of the empirical 
energies could be fitted with functions of the form given in 
(I17> . The value of the parameter a in case of t = 3 years 
was a = 1.05, a = 0.77, a = 0.59 for the Condensed Mat- 
ter, High Energy Physics - Phenomenology and Astrophysics 
archives respectively. 

The results for the Astrophysics archive are shown in 
Fig 13 The gap at A; — in the energy together with the 
suppression of the vertices with zero degrees seen in case of 
the neuroscience- and math publications is also present in all 
three systems. 



3. The network of movie actors 

In the data of the US movie actor network ll22il . the time 
resolution of the appearance of new films was one year For 
this network, the allowed k interval in the reverse engineering 
is reasonably wide. Similarly to the co-authorship networks 
studied in the previous paragraphs, the non-linear part of the 
empirical energy can be fitted with functions of the form given 
in ( I17K and a gap at fc = together with the suppression of 
the vertices with zero degrees can be observed. Our results for 
T — 3 years are shown in Fig|5lwith a = 0.74. 

C. Asset graphs of the New York Stock Exchange 

In this study, we used the asset graph sequence received 
from 1 24]. These graphs were constructed from correlations 
of the 477 NYSE traded stocks: in a given time window the 
476 most strongly correlated pairs of papers were linked L20il . 
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FIG. 6: The empirical <?!)(fc) of the asset graphs of the New York 
Stock Exchange with Cifc subtracted from the data (circles), plotted 
together with — (5fcln(fc) (solid line). The fitting parameters were 
Co = 0, Ci = 8.5,(5 = 1.7. The inset shows 7(fc) ~ Ci plotted 
together with — 5 In(fc). 



Therefore in this case the obtained graphs were simple graphs. 
The market data was recorded with a time window width of 
1000 days, from 02-Jan-1980 to 31 -Dec- 1999, and the win- 
dow was stepped monthly. This resulted in 195 time windows. 
In this case, the general rule used to determine fcmax would re- 
sult in accepting such degrees also that are represented by less 
than one vertex on average, therefore fcmax was set equal to the 
highest degree represented by at least one vertex on average, 
yielding fc^ax = 20. 

Similarly to the collaboration networks, the non-linear part 
of the empirical energy can be fitted using Mil with a — 1.7, 
as shown in Fig. |6l For this system, the gap in the energy at 
/c = seen in the case of the collaboration networks is absent, 
and accordingly the Cq fitting parameter can be set to zero. 

VI. DISCUSSION 

For each studied natural network, the non-linear part of the 
empirical single-vertex energy can be fitted with the same 
universal function given in Eq. Mil . We note that (due to 
the relatively narrow intervals in k) power-laws of the form 
E = — B /cf can be used for fitting equally well, with 
fitting parameters B and j3. However, the following two rea- 
sons make the logarithmic fit, E — — 5 li^(^j)' f^i" more 
favorable: first, this latter has fewer fitting parameters; and 
second, since in each example a has been found to be very 
close to unity (within a factor of two), even a is not a real 
fitting parameter. 

The logarithmic fit is also consistent with the rule of prefer- 
ential attachment observed in growing networks llq. IZSl l2al . 
This can be shown by estimating the energy change of a vertex 
by the derivative of (f)*{k) + ak — —ak{ln{k) — 1), yielding 
AE/T — — 51n(fc) (the iiTelevant linear term ak has been 
added to 4>*{k) for simplicity). Plugging this into the Boltz- 
mann factor, exp[—AE/T], the resulting acception/rejection 
ratio for a randomly selected move is proportional to k". In 



case of 5 = 1 we recover the linear preferential attachment 
rule. For several growing networks, however, the preferential 
attachment was found to scale as /c" 03 US- The values for 
a measured by Jeong et al.fT^ are close to those we obtained 
for a. Hence our results provide a novel justification of the 
preferential attachment rule. 

Furthermore, in the second representation of the energy, in 
which vertices contribute to the total energy depending on the 
degrees of their neighbors, the energy contribution of vertex i 
becomes ^ 7* (/ci') = —5^,-, ln(fci/), where i' runs over the 
neighbors of vertex i. This can be interpreted as follows: the 
vertices "feel" advantageous to be linked to other vertices of 
high degree, hence the magnitude of a neighbor's contribution 
to the energy of vertex « is a monotonously increasing function 
of that neighbor's degree. The distinction between the neigh- 
bors in this picture follows the general Weber-Fechner law of 
sensation: a stimulus of a given degree generates a percep- 
tion proportional to the logarithm of that degree in the neigh- 
bors. This logarithmic law of perception is natural in cases 
where the intensity of the stimulus may vary over several or- 
ders of magnitude, which is the case for degrees in scale-free 
networks. 

Another situation where the use of g{k) is very convenient 
is when it is advantageous to reach many other vertices via 
short paths, but it is expensive to have lots of connections. 
Under such circumstances the best strategy is to connect to 
a few other vertices with high degrees: numerous vertices 
can be reached in two steps at the cost of few own links. If 
the number of total edges is allowed to vary, such a scenario 
could simply be modeled by a decreasing g{k) function (such 
as — ln(A:)) together with a negative chemical potential (cor- 
responding to a linear penalty for the own links of a vertex). 



VII. SUMMARY 

We developed a reverse engineering method to deduce the 
preferences governing the restructuring in non-growing net- 
works from the statistics of the observed relocations. Our 
approach is applicable to systems where the preferences can 
be interpreted by single-vertex energy functions and the dy- 
namics obeys detailed balance. The method was first tested 
on Monte-Carlo simulations ran with known energy functions 
yielding reassuring results. Real networks such as the co- 
authorship network of scientific publications, the network of 
the US movie actors and the asset graphs of the New York 
Stock Exchange were also studied. 

In each case, the non-Hnear part of the single-vertex energy 
function could be fitted with the same universal function E = 
—a J2i ki In(fci) with 0.5 < a < 2. This energy is shown to 
be consistent with the preferential attachment rule of growing 
networks. 

Furthermore, the obtained energy can be alternatively writ- 
ten as E = — ln(fc,i/), where the summation for 
i' runs over the neighbors of vertex i. In this representation 
the energy contributions of the vertices can be interpreted by 
the general Weber-Fechner law of sensation: the stimulus of 
a given vertex generates a perception in its neighbor propor- 
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